Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data
نویسندگان
چکیده
Prediction of Speech Intelligibility (SI) is a topic interest for most speech processing applications, where intelligibility any importance, e.g., coding, transmission and enhancement. Traditionally, SI predictors have been based on signal methods heuristics, but more recently, an increasing number data-driven SI-predictors proposed. Data-driven prediction requires large quantities labelled data, ideally from many listening tests. Listening tests differ in factors such as vocabulary, talker, listener’s task, etc. collectively referred to the paradigm. A naïve strategy training directly stimuli, pooled different tests, futile because exact map stimulus determined, not only by stimulus, also trained this way become specialized paradigms data erroneously attributing all paradigm influences stimulus. The problem fundamental persists even idealized situation abundant. We propose that independent paradigms, underlying data. proposed concatenate SI-predictor layer trainable dataset-specific mapping functions, each corresponding single These functions are jointly with serve efficiently approximate psychometric implied prevent predictor specializing these during training. present novel architecture incorporates convolutional network ESTOI back-end, train it strategy, compare range existing non-data-driven predictors. results higher performance overall increased robustness unseen paradigms.
منابع مشابه
Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining
This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...
متن کاملIntelligibility predictors and neural representation of speech
Intelligibility predictors tell us a great deal about human speech perception, in particular which acoustic factors strongly effect human behavior, and which do not. A particular intelligibility predictor, the Articulation Index (AI), is interesting because it models human behavior in noise, and its form has implications about representation of speech in the brain. Specifically, the Articulatio...
متن کاملthe effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension
کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...
15 صفحه اولPhysiologically Based Predictors of Speech Intelligibility
Introduction Just a century ago, researchers at AT&T’s Western Electric Research Labs (later renamed Bell Labs) began a comprehensive research program to develop an objective predictor of speech intelligibility that would provide a tool for efficiently assessing different speech telecommunications systems (Allen, 1996). This work was largely directed by the first president of the Acoustical Soc...
متن کاملeffect of listening strategy training on iranian efl learners listening comprehension and use of strategies
this study aimed to investigate the effects of listening strategy training on iranian efl learners listening comprehension and use of such strategies. this work, employing an experimental methodology, was conducted among 60 adult efl learners from a language institute in isfahan, iran, as participants. the participants, who were selected based on the results of a placement test, were assigned t...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3184785